Model Selection

Low Memory Consumption

# Low Memory Consumption

Apriel Nemotron 15b Thinker GGUF

Apriel-Nemotron-15b-Thinker is a powerful inference model that performs excellently among models of the same scale. It has efficient memory usage and excellent inference capabilities, making it suitable for various enterprise and academic scenarios.

Large Language Model

Optical Flow MEMFOF Tartan T TSKH

MEMFOF is a memory-efficient optical flow estimation method designed for full HD videos, combining high precision and low memory usage.

Video Processing

PyTorch English

FLUX.1 Dev ControlNet Union Pro 2.0 Fp8

This is the FP8 quantized version of the Shakker-Labs/FLUX.1-dev-ControlNet-Union-Pro-2.0 model, quantized from the original BFloat16 format using PyTorch's native FP8 support to optimize inference performance.

Image Generation English

Deepseek R1 Distill Qwen 1.5B

Multiple variants based on DeepSeek-R1-Distill-Qwen-1.5B, adapted for LiteRT framework and MediaPipe LLM inference API, deployable on Android platforms.

Large Language Model

litert-community

Llama 3.2 3B Instruct Unsloth Bnb 4bit

An efficient large language model optimized with Unsloth's dynamic 4-bit quantization technology, based on Meta Llama 3.2-3B-Instruct

Large Language Model

Transformers English

Universal NER UniNER 7B All Bnb 4bit Smashed

PrunaAI's compressed version of the UniNER-7B-all model, significantly reducing memory usage and energy consumption through quantization techniques while maintaining good named entity recognition capabilities.

Large Language Model

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase